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Plants having increased yield and method for making the same 

The present invention relates generally to the field of molecular biology and concerns a 
method for increasing plant yield. More specifically, the present invention concerns a method 
5 for increasing plant yield, particularly seed yield, by introducing into a plant a nucleic acid 
encoding a D-type Cyclin-Dependent Kinase (CDKD). The present invention also concerns 
plants produced by the methods according to the invention, which plants have increased yield 
relative to corresponding wild type plants. The invention also concerns constructs useful in the 
methods of the invention. 

10 

The ever-increasing world population and the dwindling supply of arable land available for 
agriculture fuel research towards improving the efficiency of agriculture. Conventional means 
for crop and horticultural improvements utilise selective breeding techniques to identify plants 
having desirable characteristics. However, such selective breeding techniques have several 

15 drawbacks, namely that these techniques are typically labour intensive and result in plants that 
often contain heterogeneous genetic components that may not always result in the desirable 
trait being passed on from parent plants. Advances in molecular biology have allowed 
mankind to modify the germplasm of animals and plants. Genetic engineering of plants entails 
the isolation and manipulation of genetic material (typically in the form of DNA or RNA) and the 

20 subsequent introduction of that genetic material into a plant. Such technology has the capacity 
to deliver crops or plants having various improved economic, agronomic or horticultural traits. 
A trait of particular economic interest is yield. Yield is normally defined as the measurable 
produce of economic value from a crop. This may be defined in terms of quantity and/or 
quality. Yield is directly dependent on several factors, for example, the number and size of the 

25 organs, plant architecture (for example, the number of branches), seed production and more. 
Root development, nutrient uptake and stress tolerance are also important factors in 
determining yield. Crop yield may be increased by optimizing one of the abovementioned 
factors, which may be done by modifying the inherent growth mechanisms of a plant. 

30 The inherent growth mechanisms of a plant reside in a highly ordered sequence of events 
collectively known as the 'cell cycle'. Progression through the cell cycle is fundamental to the 
growth and development of all multicellular organisms and is crucial to cell proliferation. The 
major components of the cell cycle are highly conserved in yeast, mammals, and plants. The 
cell cycle is typically divided into the following sequential phases: GO — G1 - S - G2 - M. DNA 

35 replication or synthesis generally takes place during the S phase ("S" is for DNA synthesis) 
and mitotic segregation of the chromosomes occurs during the M phase (the "M" is for mitosis), 
with intervening gap phases, G1 (during which cells grow before DNA replication) and G2 (a 
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period after DNA replication during which the cell prepares for division). Cell division is 
completed after cytokinesis, the last step of the M phase. Cells that have exited the cell cycle 
and that have become quiescent are said to be in the GO phase. Cells in this phase can be 
stimulated to renter the cell cycle at the G1 phase. The "G" in Gl, G2 and GO stands for "gap". 
5 Completion of the cell cycle process allows each daughter cell during cell division to receive a 
full copy of the parental genome. 

Cell division is controlled by two principal cell cycle events, namely initiation of DNA synthesis 
and initiation of mitosis. Each transition to each of these key events is controlled by a 

10 checkpoint represented by specific protein complexes (involved in DNA replication and 
division). The expression of genes necessary for DNA synthesis at the G1/S boundary is 
regulated by the E2F family of transcription factors in mammals and plant cells (La Thangue, 
1994; Muller et a/., 2001; De Veylder et a/., 2002). Entry into the cell cycle is 
regulated/triggered by an E2F/Rb complex that integrates signals and allows activation of 

15 transcription of cell cycle genes. The transition between the different phases of the cell cycle, 
and therefore progression through the cell cycle, is driven by the formation and activation of 
different heterodimeric serine/threonine protein kinases, generally referred to as cyclin- 
dependent kinases (CDKs). A prerequisite for activity of these kinases is the physical 
association with a specific cyclin, the timing of activation being largely dependent upon cyclin 

20 expression. Cyclin binding induces conformational changes in the N-terminal lobe of the 
associating CDK and contributes to the localisation and substrate specificity of the complex. 
Monomeric CDKs are activated when they are associated with cyclins and thus have a kinase 
activity. Cyclin protein levels fluctuate in the cell cycle and therefore represent a major factor 
in determining timing of CDK activation. The periodic activation of these complexes containing 

25 cyclins and CDK during cell cycle mediates the temporal regulation of cell-cycle transitions 
(checkpoints). Other factors regulating CDK activity include CDK inhibitors (CKIs or ICKs, 
KIPs, CIPs, INKs), CDK activating kinases (CAK), CDK phosphatasea (Cdc25) and CDK 
subunits (CKS) (Mironov etal. 1999; Reed 1996). 

30 In plants, two major classes of CDKs, known as A-type and B-type CDKs, have been studied 
to date. The A-type CDKs regulate both the G1-to-S and G2-to-M transitions, whereas the IB- 
type CDKs seem only to control the G2~to-M checkpoint (Hemerly et a/., 1995; Magyar et a/., 
1997; Porceddu et a/., 2001). in addition, the presence of C-type CDKs and CDK-activating 
kinases (CAKs) has been reported (Magyar et a/., 1997; Umeda et al. s 1998; JoubSs et a/., 

35 2001). Vandepoele et a/., 2002, identified four CAKs by a homology-based annotation 
method. These CAKs were three D type CAKs (Arath; CDKD;1 $ Arath; CDKD;2 and 
Arath; CDKD;3); and one F-type CAK (Arath; CDKF;1). 
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Yamaguchi et aL (PNAS Vol. 100 (13) 8019-8023, 2003) describe the overexpression of rice 
R2 cDNA (which encodes a CAK) in tobacco leaf explants. They reported that transient 
expression of R2 during the first 7 days of culture triggered callus formation in the absence of 
5 cytokinin. Yamaguchi et aL also examined the control of in vitro organogenesis by CDK. 

Fabian-Marwedel et a/., (The Plant Cell, Vol. 14, 197-210, 2002) report that the rice CAK, R2, 
regulates S-phase progression and overall growth rate in suspension cells. 

10 The ability to influence the cell cycle of a plant, and to thereby modify various growth 
characteristics of a plant, would have applications in areas such as crop enhancement, plant 
breeding, in the production of ornamental plants, aboriculture, horticulture, forestry, the 
production of algae for use in bioreactors (for the biotechnological production of substances 
such as pharmaceuticals, antibodies, or vaccines, or for the bioconversion of organic waste) 

15 and other such areas. 

It has now been found that introduction into a plant of a CDKD-encoding nucleic acid gives 
plants having increased yield relative to corresponding wild type plants. Therefore according 
to one embodiment of the present invention there is provided a method for increasing yield in a 
20 plant, comprising introducing into a plant a nucleic acid encoding a CDKD. 

The term "increased yield" as defined herein is taken to mean an increase in any one or more 
of the following, each relative to corresponding wild type plants: (i) increased biomass (weight) 
of one or more parts of a plant, particularly aboveground (harvestable) parts, increased root 

25 biomass or increased biomass of any other harvestable part; (ii) increased seed yield, which 
may result from an increase in the biomass of the seed (seed weight) and which may be an 
increase in the seed weight per plant or on an individual seed basis, and which increase in 
seed weight may be due to altered seed dimensions, such as seed length and/or seed width 
and/or seed area; (iii) increased number of (filled) seeds; (iv) increased seed size, which may 

30 also influence the composition of seeds; (v) increased seed volume, which may also influence 
the composition of seeds; (vi) increased harvest index, which is expressed as a ratio of the 
yield of harvestable parts, such as seeds, over the total biomass; and (vii) increased thousand 
kernel weight (TKW), which is extrapolated from the number of filled seeds counted and their 
total weight. An increased TKW may result from an increased seed size and/or seed density. 

35 

According to a preferred embodiment of the invention, the increase in yield encompasses an 
increase in yield on a seed level as defined in any one or more of (ii) to (vii) above. 
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Taking corn as an example, a yield increase may be manifested as one or more of the 
following: increase in the number of plants per hectare or acre, an increase in the number of 
ears per plant, an increase in the number of rows, number of kernels per row, kernel weight, 
5 thousand kernel weight, ear length/diameter, among others. Taking rice as an example, a 
yield increase may be manifested by an increase in one or more of the following: number of 
plants per hectare or acre, number of panicles per plant, number of spikelets per panicle, 
number of flowers per panicle, increase in the seed filling rate, increase in thousand kernel 
weight, among others. An increase in yield may also result in modified architecture, or may 
10 occur as a result of modified architecture. 

According to a preferred feature, performance of the methods of the invention result in plants 
having increased yield which is manifested by at least one of: increased aboveground area, 
increased TKW 3 increased number of filled seeds, increased seed weight and increased 
15 harvest index, each relative to control or corresponding wild-type plants. 

Performance of the methods of the invention advantageously leads to increased yield in any 
plant. 

20 The term "plant" as used herein encompasses whole plants, ancestors and progeny of the 
plants and plant parts, including seeds, shoots, stems, leaves, roots (including tubers), and 
plant cells, tissues and organs, wherein each of the aforementioned comprise the gene of 
interest. The term "plant" also encompasses embryos, meristematic regions, garnetophytes, 
sporophytes, pollen, and microspores, again wherein each of the aforementioned comprise the 

25 gene of interest. The term plant, as defined herein, does not include suspension cultures and 
callus tissue. 

The methods of the invention may be performed on any plant, particularly all plants which 
belong to* the superfamily Viridiplantae, in particular monocotyledonous and dicotyledonous 

30 plants including fodder or forage legumes, ornamental plants, food crops, trees or shrubs 
selected from the list comprising Acacia spp.,.Acerspp-, Actinidia spp., Aesculus spp., Agathis 
australis, Albizia amara, Alsophila tricolor, Andropogon spp., Arachis spp, Areca catechu, 
Astelia fragrans, Astragalus cicer, Baikiaea plurijuga, Betula spp., Brassica spp-, Bruguiera 
gymnorrhiza, Burkea africana, Butea frondosa, Cadaba farinosa, Calliandra spp, Camellia 

35 sinensis, Canna indica, Capsicum spp., Cassia spp., Ceniroema pubescens, Cfiaenomeles 
spp., Cinnamomum cassia, Coffea arabica, Coiophospermum mopane, Cor&nillia varia, 
Cotoneaster serotina, Crataegus spp., Cucumis spp., Cupressus spp., Cyath&a dealbata, 

4 
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Cydonia oblonga, Cryptomeria japonica, Cymbopogon spp., Cynthea dealbata, Cydonia 
oblonga, Dalbergia monetaria, Davallia divaricata, Desmodium spp., Dicksonia squarvsa, 
Diheteropogon amplectens, Dioclea spp, Dolichos spp., Dorycnium rectum, Echinachloa 
pyramidalis, Ehrartia spp., Eleusine coracana, Eragrestis spp., Erythrina spp., Eucalyptus spp., 
5 Euclea schimperi, Eulalia villosa, Fagopyrum spp., Feijoa sellowiana, Fragaria spp., Flemingia 
spp, Freycinetia banksii, Geranium thunbergii, Ginkgo biloba, Glycine javanica, GHricidia spp, 
Gossypium hirsutum, Grevillea spp., Guibourtia coleosperma, Hedysarum spp., Hemarthia 
altissima, Heteropogon contortus, Hordeum vulgare, Hyparrhenia rufa, Hypericum er&ctum, 
Hyperthelia dissoluta, Indigo incarnata, Iris spp., Leptarrhena pyrolifolia, Lespediza spp., 

10 Lettuca spp., Leucaena leucocephala, Loudetia simplex, Lotonus bainesii, Lotus spp., 
Macrotyloma axillare, Malus spp., Manihot esculenta, Medicago sativa, Metas&quoia 
glyptostrvboides, Musa sapientum, Nicotianum spp., Onobrychis spp., Ornithopus spp., Oryza 
spp., Peltophorum africanum, Pennisetum spp., Persea gratissima, Petunia spp., Phaseolus 
spp., Phoenix canariensis, Phormium cookianum, Photinia spp., Picea glauca, Pinus spp., 

15 Pisum sativum, Podocarpus totara, Pogonarthria fleckii, Pogonarthria squarrosa, Populus spp., 
Prosopis cineraria, Pseudotsuga menziesii, Pterolobium stellatum, Pyrus communis, Quercus 
spp., Rhaphiolepsis umbellata, Rhopalostylis sapida, Rhus natalensis, Ribes gros&ularia, 
Ribes spp., Robinia pseudoacacia, Rosa spp., Rubus spp., Salix spp., Schyzachyrium 
sanguineum, Sciadopitys verticillata, Sequoia sempen/irens, Sequoiadendron giganteum, 

20 Sorghum bicolor, Spinacia spp., Sporoboius fimbriatus, Stiburus alopecuroides, Stylosanthos 
humilis, Tadehagi spp, Taxodium distichum, Themeda triandra, Trifolium spp., Triticum spp., 
Tsuga heterophylla, Vaccinium spp., Vicia spp., Vitis vinifera, Watsonia pyramidata, 
Zantedeschia aethiopica, Zea mays, amaranth, artichoke, asparagus, broccoli, Brussels 
sprouts, cabbage, canola, carrot, cauliflower, celery, collard greens, flax, kale, lentil, oilseed 

25 rape, okra, onion, potato, rice, soybean, strawberry, sugar beet, sugar cane, sunflower, 
tomato, squash, tea and algae, amongst others. According to a preferred embodiment of the 
present invention, the plant is a crop plant such as soybean, sunflower, canola, alfalfa, 
rapeseed, cotton, tomato, potato or tobacco. Further preferably, the plant is a 
monocotyledonous plant, such as sugar cane. More preferably the plant is a cereal, such as 

30 rice, maize, wheat, barley, millet, rye, sorghum or oats. 

The terms "D-type CDK" or "CDKD" are used interchangeably herein and refer to any amino 
acid sequence which, when used in the construction of a CDK phylogenetic tree, such as the 
one depicted in Fig. 1 , clusters around or in a group which includes D-type CDKs, but no other 
35 CDK types, such as A-, B-, C-, E- or F-type CDKs. Reference herein to a nucleic acid 
encoding a CDKD is to a nucleic acid encoding a CDKD amino acid as defined above. 
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A person skilled in the art could readily determine whether any amino acid sequence in 
question falls within the aforementioned definition using known techniques and software for the 
making of such a phylogenetic tree, such as a GCG, EBl or CLUSTAL package, using default 
parameters. Upon construction of such a phylogenetic tree, sequences clustering around or in 
5 the D-type CDK group will be considered to fall within the definition of a "D-type CDK". Nucleic 
acids encoding such sequences will be useful in performing the methods of the invention. 

D-type CDKs typically have the ability to phosphorylate and activate CDKs and have also been 
shown to phosphorylate and activate RNA polymerase II. D-type CDKs may also exhibit one 
10 or more and preferably all of the following features: (i) an NXTALRE motif, where X is any 
amino acid; (ii) a catalytic kinase domain; and (iii) the ability to bind to cyclin H. 

A CDKD may easily be distinguished from any other CDK since the motif NXTALRE is 
particular to this type of CDK (according to current knowledge). In contrast, according to 
15 current knowledge, an A-type CDK will have a PSTAIRE motif; a B-type CDK a 
P(P/S)T(A/T)(L/M)RE motif; a C-type CDK a PITAIRE motif; an E-type CDK will have an 
SPTARE motif; and an F-type CDK will have a XSAXRE motif. 

A person skilled in the art may readily assay for kinase activity on, for example, purified 
20 substrates such as human CAK2 or on the Arabidopsis thaliana RNA polymerase II carboxy- 
terminus. The ability of a CDKD to bind to cyclin H may readily be determined by co- 
precipitation of CDKD-cyclin H complexes from purified CDKD and cyclin H, or by using a two 
hybrid assay. 

25 In Arabidopsis thaliana, CDKDs are encoded by 3 different genes, CDKD;1, CDKD;2 and 
CDKD;3, each gene encoding a protein which comprises the motif NXTALRE, wherein X is any 
amino acid. 

Advantageously, the methods of the invention may be performed using any nucleic acid 
30 encoding a CDKD as defined hereinabove. Introduction into a plant of a CDKD-encoding 

nucleic acid gives modulated expression (preferably increased expression) in a plant of such a 

nucleic acid and/or modulated (preferably increased) activity and/or levels in a plant of a CDKD 

polypeptide. The activity of a CDKD may be increased by increasing levels of the polypeptide. 

Alternatively, activity may be increased when there is no change in levels of a CDKD 
35 polypeptide, or even when there is a reduction in levels of a CDKD polypeptide. This may 

occur when the intrinsic properties of the polypeptide are altered, for example, by making 

mutant versions that are more active than the wild type polypeptide. 

6 
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The nucleic acid encoding a CDKD is preferably operably linked to a constitutive promoter for 
overexpression in a plant. The constitutive promoter is prefer-ably a GOS2 promoter, further 
5 preferably a GOS2 promoter from rice. It should be clear that the applicability of the present 
invention is not restricted to use of a CDKD from Arabidopsis thaliana, nor to a CDKD 
represented by SEQ ID NO: 1, nor is the applicability of the invention restricted to expression 
of a CDKD-encoding nucleic acid when driven by a GOS2 promoter. 

According to a preferred aspect of the present invention, enhanced or increased expression of 
the CDKD nucleic acid is envisaged. Methods for obtaining enhanced or increased expression 
of genes or gene products are well documented in the art and include, for example, 
overexpression driven by a strong promoter, the use of transcription enhancers or translation 
enhancers. 

The nucleic acid encoding a CDKD may be derived from any source. The nucleic acid/gene 
encoding a CDKD may be isolated from a microbial source, such as bacteria, yeast or fungi, or 
from a plant, algae or animal (including human) source. This nucleic acid may be modified 
from its native form in composition and/or genomic environment through deliberate human 
manipulation. The nucleic acid is preferably a homologous nucleic acid, i.e. a nucleic acid 
obtained from a plant, whether from the same plant species in which it is to be introduced or 
whether from a different plant species. The nucleic acid may be isolated from a 
dicotyledonous species, preferably from the family Brassicaceae, further preferably from 
Arabidopsis thaliana. More preferably, the CDKD-encoding nucleic acid is isolated from 
Arabidopsis thaliana is a CDKD;1, CDKD;2 or a CDKD;3. Most preferably, the CDKD is 
CDKD;1 from Arabidopsis thaliana, particularly the nucleic acid sequence as represented by 
SEQ ID NO: 1 and the corresponding amino acid sequence as M"epresented by SEQ ID NO: 2. 

Advantageously, the* performance of the present invention i& not restricted to the use of a 
>0 CDKD;1 from Arabidopsis as represented by SEQ ID NO: 1. The methods according to the 
present invention may also be practised using functional variants of a CDKD as defined 
hereinabove or using functional variants of CDKD-encoding nucleic acids. Preferred functional 
variants are variants of the nucleic acid sequence represented by SEQ ID NO: 1 or functional 
variants of the amino acid sequence represented by SEQ ID NO: 2. 

15 

The term "functional variant" as defined herein is a variant whi<^h falls under the definition of a 
CDKD as hereinbefore defined. Preferably, the functional variant also has the ability to 

7 
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phosphorylate and activate CDKs and phosphorylate and activate RNA polymerase II. 
Preferably, the D-type CDK functional variant also exhibits one or more and preferably all of 
the following features: (i) an NXTALRE motif, where X is any amino acid; (ii) a catalytic kinase 
domain; and (iii) the ability to bind to cyclin H. A person skilled in the art may also readily 
determine whether a particular variant is functional (in the sense of whether it is able to 
increase plant yield) by simply substituting the sequence described in the Examples section 
below with the variant to be tested for function. 

Suitable variant nucleic acid and amino acid sequences useful in practising the method 
according to the invention, include: 

(i) Functional portions of a CDKD-encoding nucleic acid; 

(ii) Sequences capable of hybridising to a CDKD-encoding nucleic acid; 

(iii) Alternative splice variants of a CDKD-encoding nucleic acid; 

(iv) Allelic variants of a CDKD-encoding nucleic acid; and 

(v) Homologues, derivatives and active fragments of a CDKD amino acid. 

Each of the aforementioned variants is a functional variant, as defined hereinbefore. 

It will be apparent to a person skilled in the art that the use of the full-length CDKD-encoding 
DNA sequence would not be a prerequisite to carrying out the methods according to the 
invention. The methods according to the invention may advantageously be practised using 
functional portions of a CDKD-encoding DNA/nucleic acid, preferably functional portions of a 
nucleic acid sequence as represented by SEQ ID NO: 1 A functional portion is a CDKD- 
encoding nucleic acid falling under the definition of a functional variant as defined hereinabove. 
A portion may be prepared, for example, by making one or more deletions to a CDKD- 
encoding nucleic acid, such as the nucleic acid sequence of SEQ ID NO: 1, using techniques 
well known in the art. 

Therefore according to the invention, there is provided, a method for increasing plant yield, in 
particular seed yield, comprising introducing into a plant a portion of a CDKD-encoding nucleic 
acid. 

Another variant is a sequence capable of hybridising to a CDKD-encoding nucleic acid. Such 
hybridising sequences are those falling under the definition of functional variants as defined 
hereinabove. Particularly preferred are sequences capable of hybridising to a CDKD-encoding 
nucleic acid under stringent conditions, especially to a CDKD-encoding nucleic acid as 
represented by SEQ ID NO: 1. 
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The term "hybridisation" as defined herein is a process wherein substantially homologous 
complementary nucleotide sequences anneal to each other. The hybridisation process can 
occur entirely in solution, i.e. both complementary nucleic acids are in solution. Tools in 
molecular biology relying on such a process include the polymerase chain reaction (PCR; and 

5 all methods based thereon), subtractive hybridisation, random primer extension, nuclease S1 
mapping, primer extension, reverse transcription, cDNA synthesis, differential display of RNAs, 
and DNA sequence determination. The hybridisation process can also oocur with one of the 
complementary nucleic acids immobilised to a matrix such as magnetic beads, Sepharose 
beads or any other resin. Tools in molecular biology relying on such a process include the 

10 isolation of poly (A + ) mRNA. The hybridisation process can furthermore occur with one of the 
complementary nucleic acids immobilised to a solid support such as a nitro-cellulose or nylon 
membrane or immobilised by e.g. photolithography to, for example, a siliceous glass support 
(the latter known as nucleic acid arrays or microarrays or as nucleic aci<l chips). Tools in 
molecular biology relying on such a process include RNA and DNA gel blot analysis, colony 

15 hybridisation, plaque hybridisation, in situ hybridisation and microarray hybridisation. In order 
to allow hybridisation to occur, the nucleic acid molecules are generally thermally or chemically 
denatured to melt a double strand into two single strands and/or to remo\re hairpins or other 
secondary structures from single stranded nucleic acids. The stringency of hybridisation is 
influenced by conditions such as temperature, salt concentration and hybridisation buffer 

10 composition. Hybridisation preferably occurs under stringent conditions. Stringent conditions 
are those that (1) employ low ionic strength and high temperature for washing, for example, 
0.5M sodium phosphate buffer pH 7.2, 1mM EDTA pH 8.0 in 7% SDS at either 65°C or 55°C, 
" or (2) employ during hybridization a denaturing agent such as fbrmamide, for example, 50% 
(vol/vol) formamide with 0.1% bovine serum albumin, 0.1% Ficoll, 0.1% 

\5 polyvinylpolypyrrolidone, 0.05 M sodium phosphate buffer at pH 6.5 with 0.75 M NaCI, 0.075 M 
sodium citrate at 42°C. A specific example includes the use of 50% formamide, 5XSSC (0.75 
M NaCI, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6-8), 0.1% sodium 
pyrophosphate, 5X Denhard's solution, sonicated salmon sperm DNA (50nm/ml), 0.1% SDS 
> and 10% dextran sulfate at 55°C, with washes at 55°C in 0.2XSSC and 0.1 % SDS. A skilled 

•0 person can readily determine and vary the stringency conditions appropriately to obtain a clear 
and detectable hybridization signal. 

Therefore according to the invention, there is provided, a method for increasing plant yield, 
comprising introducing into a plant a sequence capable of hybridising, preferably under 
5 stringent conditions, to a CDKD-encoding nucleic acid. 
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Another variant usefiul in the methods of the invention is an alternative splice variant of a 
CDKD-encoding nucleic acid. Suitable splice variants are those falling under the definition of a 
functional variant as defined hereinabove. The term "alternative splice variant" as used herein 
encompasses variants of a nucleic acid in which selected introns and/or exons have been 
5 excised, replaced or added. Such variants will be ones in which the biological activity of the 
protein remains unaffected, which can be achieved by selectively retaining functional 
segments of the protein. Such splice variants may be found in nature or may be manmade. 
Methods for making such splice variants are well known in the art. Splice variants of SEQ ID 
NO: 1 are particularly preferred for use in the methods according to the invention. 

10 

Therefore, the invention also provides a method for increasing plant yield, comprising 
introducing into a plant an alternative splice variant of a CDKD-encoding nucleic acid. 

Another variant useful in the methods of the invention is an allelic variant of a CDKD-encoding 
15 nucleic acid. Suitable allelic variants are those falling under the definition of a functional variant 
as defined hereinabove. Allelic variants exist in nature and encompassed within the methods 
of the present invention is the use of these natural alleles. Allelic variants encompass Single 
Nucleotide Polymorphisms (SNPs), as well as Small Insertion/Deletion Polymorphisms 
(INDELs). The size of INDELs is usually less than 100 bp. SNPs and INDELs form the largest 
20 set of sequence variants in naturally occurring polymorphic strains of most organisms. Allelic 
variants of SEQ ID NO: 1 are particularly preferred for use in the methods according to the 
invention. 

Therefore, the invention also provides a method for increasing plant yield, comprising 
25 introducing into a plant an allelic variant of a CDKD-encoding nucleic acid. 

Further advantageously, the methods according to the present invention may also be practised 
using homologues, derivatives or active fragments of a CDKD. Nucleic acids encoding 
homologues, derivatives or active fragments of an amino acid, such as the one represented by 
30 SEQ ID NO: 2, may readily be determined using routine techniques well known to persons 
skilled in the art. Such nucleic acids suitable for use in the methods of the invention may 
readily be determined as described hereinbefore. 

"Homologues" of a protein encompass peptides, oligopeptides, polypeptides, proteins and 
35 enzymes having amino acid substitutions, deletions and/or insertions relative to the unmodified 
protein in question and having similar biological and functional activity as the unmodified 
protein from which they are derived. To produce such homologues, amino acids of the protein 
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may be replaced by other amino acids having similar properties (such as similar 
hydrophobicity, hydrophilicity, antigenicity, propensity to form or break a-helical structures or p- 
sheet structures). Conservative substitution tables are well known in the art (see for example 
Creighton (1984) Proteins. W.H. Freeman and Company). 

5 

The homologues useful in the methods according to the invention have in increasing order of 
preference at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95% sequence identity 
to the amino acid sequence as represented by SEQ ID NO: 2. CDKDs show about 65% 
identity to each other and show less than 40% identity to other CDKs. Therefore a homologue 
10 having at least 50% identity to the CDK as represented by SEQ ID NO: 2 will not encompass 
any other CDK other than a D-type CDK. 

Also encompassed by the term "homologues" are two special forms of homology, which 
include orthologous sequences and paralogous sequences, which encompass evolutionary 
15 concepts used to describe ancestral relationships of genes. The term "paralogous" relates to 
gene-duplications within the genome of a species leading to paralogous genes. The term 
"orthologous" relates to homologous genes in different organisms due to ancestral relationship. 

Othologues in, for example, monocot plant species may easily be found by performing a so- 
20 called reciprocal blast search. This may be done by a first blast involving blasting the 
sequence in question (for example, SEQ ID NO: 1 or SEQ ID NO: 2) against any sequence 
database, such as the publicly available NCBI database which may be found at: 
http://www.ncbi.nlm.nih.gov. If orthologues in rice were sought, the sequence in question 
would be blasted against, for example, the 28,469 full-length cDNA clones from Oryza sativa 
25 Nipponbare available at NCBI. BLASTn may be used when starting from nucleotides or 
TBLASTX when starting from the protein, with standard default values (expectation 10, 
alignment 50). The blast results may be filtered. The full-length sequences of either the 
filtered results or the non-filtered results are then blasted back (second blast) against the 
sequence in question (SEQ ID NO: 1 or 2). The results of the first and second blasts are then 
30 compared. In the case of large families, ClustalW is used followed by a neighbour joining tree 
to help visualize the clustering. 

A homologue may be in the form of a "substitutional variant" of a protein, i.e. where at least 
one residue in an amino acid sequence has been removed and a different residue inserted in 
35 its place. Amino acid substitutions are typically of single residues, but may be clustered 
depending upon functional constraints placed upon the polypeptide; insertions will usually be 
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of the order of about 1 to 10 amino acid residues, and deletions will range from about 1 to 20 
residues. Preferably, amino acid substitutions comprise conservative amino acid substitutions. 

A homologue may also be in the form of an "insertional variant" of a protein, i.e. where one or 
5 more amino acid residues are introduced into a predetermined site in a protein. Insertions may 
comprise amino-terminal and/or carboxy-terminal fusions as well as intra-sequence insertions 
of single or multiple amino acids. Generally, insertions within the amino acid sequence will be 
smaller than amino- or carboxy-terminal fusions, of the order of about 1 to 10 residues- 
Examples of amino- or carboxy-terminal fusion proteins or peptides include the binding domaira 
0 or activation domain of a transcriptional activator as used in the yeast two -hybrid system, 
phage coat proteins, (histidine)6-tag, glutathione S-transferase-tag, protein A, maltose-bindingi 
protein, dihydrofolate reductase, Tag- 100 epitope, c-myc epitope, FLAGO-epitope, lacZ, CMP 
(calmodulin-binding peptide), HA epitope, protein C epitope and VSV epitope. 

5 Homologues in the form of "deletion variants" of a protein are characterised by the removal off 
one or more amino acids from a protein. 

Amino acid variants of a protein may readily be made using peptide synthetic techniques well 
known in the art, such as solid phase peptide synthesis and the like, or by recombinant DNA. 

0 manipulations. Methods for the manipulation of DNA sequences to produce substitution, 
insertion or deletion variants of a protein are well known in the art. For example, techniques 
for making substitution mutations at predetermined sites in DNA are well known to those* 
skilled in the art and include M13 mutagenesis, T7-Gen in vitro mutagenesis (USB, Cleveland, 
OH), QuickChange Site Directed mutagenesis (Stratagene, San Diego, CA), PCR-mediated 

5 site-directed mutagenesis or other site-directed mutagenesis protocols. 

"Derivatives" include peptides, oligopeptides, polypeptides, proteins and enzymes which may- 
comprise substitutions, deletions or additions of naturally and non-naturally occurring amino 
acid residues compared to the amino acid sequence of a naturally-occurring form of the 

0 protein, for example, as presented in SEQ ID NO: 2. "Derivatives" of a protein encompass 
peptides, oligopeptides, polypeptides, proteins and enzymes which may comprise naturally 
occurring altered, glycosylated, acylated or non-naturally occurring amino acid residues 
compared to the amino acid sequence of a naturally-occurring form of the polypeptide. A 
derivative may also comprise one or more non-amino acid substituents compared to the amino 

5 acid sequence from which it is derived, for example a reporter molecule or other ligand, 
covalently or non-covalently bound to the amino acid sequence, such as a reporter molecule 
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which is bound to facilitate its detection, and non-naturally occurring amino acid residues 
relative to the amino acid sequence of a naturally-occurring protein. 

"Active fragments" of a CDKD protein encompasses sufficient amino acid residues to cluster 
5 around or in the D-type CDK group upon construction of a phylogenetic tree, such as the one 
shown in Figure 1. When using fragments in such a phylogenetic tree, like should be 
compared with like, meaning that corresponding fragments of the other CDKs should be used 
to make the tree. 

10 Methods for the search and identification of CDKD homologues would be well within the realm 
of a person skilled in the art. Methods for the alignment of sequences for comparison are well 
known in the art, such methods include GAP, BESTFIT, BLAST, FASTA and TFASTA. GAP 
uses the algorithm of Needleman and Wunsch (J. Mol. Biol. 48: 443-453, 1970) to find the 
alignment of two complete sequences that maximises the number of matches and minimises 

15 the number of gaps. The BLAST algorithm calculates percent sequence identity and performs 
a statistical analysis of the similarity between the two sequences. The software for performing 
BLAST analysis is publicly available through the National Centre for Biotechnology 
Information. Homologues suitable for use in the methods of the invention, i.e. those having at 
least 50% sequence identity to the amino acid sequence represented by SEQ ID NO: 2, may 

20 be identified by taking full length CDK protein sequences and aligning them using the 
ClustalX1.81 software using default parameters. A distance matrix may then be calculated 
from this alignment using BOXSHADE software, again using default parameters. Both 
software programs are publicly available. 

25 Therefore, the invention also provides a method for increasing plant yield, comprising 
introducing into a plant a nucleic acid encoding a homologue, derivative or active fragment of a 
CDKD, such as a CDKD represented by SEQ ID NO: 2, which homologue, derivative or active 
fragment has in increasing order of preference at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 
85%, 90%, 95% sequence identity to an amino acid sequence rs represented by SEQ ID NO: 

30 2. 

The invention also provides genetic constructs and vectors to facilitate introduction and/or 
expression of the nucleotide sequences useful in the methods according to the invention. 

35 Therefore, there is provided a gene construct comprising: 

(i) a CDKD-encoding nucleic acid, preferably as represented by SEQ ID NO: 1 or a 
functional variant thereof (as defined hereinabove); 
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(ii) one or more control sequences capable of driving expression of the nucleic acid 
sequence of (i); and optionally 

(iii) a transcription termination sequence. 



5 Constructs useful in the methods according to the present invention may be constructed using 
recombinant DNA technology well known to persons skilled in the art. The gene constructs 
may be inserted into vectors, which may be commercially available, suitable for transforming 
into plants and suitable for expression of the gene of interest in the transformed cells. 

L0 Plants are transformed with a vector comprising the sequence of interest (i.e., a nucleic acid as 
represented by SEQ ID NO: 1 or a functional variant thereof (as defined hereinabove)). The 
sequence of interest is operably linked to one or more control sequences (at least to a 
promoter). The terms "regulatory element", "control sequence" and "promoter" are all used 
interchangeably herein and are to be taken in a broad context to refer to regulatory nucleic 

15 acid sequences capable of effecting expression of the sequences to which they are ligated. 
Encompassed by the aforementioned terms are transcriptional regulatory sequences derived 
from a classical eukaryotic genomic gene (including the TATA box which is required for 
accurate transcription initiation, with or without a CCAAT box sequence) and additional 
regulatory elements (i.e. upstream activating sequences, enhancers and silencers) which alter 

10 gene expression in response to developmental and/or external stimuli, or in a tissue-specific 
manner. Also included within the term is a transcriptional regulatory sequence of a classical 
prokaryotic gene, in which case it may include a —35 box sequence and/or -10 box 
transcriptional regulatory sequences. The term "regulatory element" also encompasses a 
synthetic fusion molecule or derivative which confers, activates or enhances expression of a 

15 nucleic acid molecule in a cell, tissue or organ. The term "operably linked" as used herein 
refers to a functional linkage between the promoter sequence and the gene of interest, such 
that the promoter sequence is able to initiate transcription of the gene of interest. 



Preferably, the nucleic acid encoding a CDKD or a functional variant thereof is operably linked 
50 to a constitutive promoter. The term "constitutive promoter" as defined herein refers to a 
promoter that is expressed predominantly in at least one tissue or organ and predominantly at 
any stage in the life cycle of a plant. Preferably, the constitutive promoter is expressed 
predominantly throughout the plant Preferably, the constitutive promoter is the GOS2 
promoter from rice. 

!5 

Examples of other constitutive promoters suitable for use in the methods of the invention are 
listed in Table A below. 
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Table A: Examples of constitutive promoters for use in performance of the invention 



GENE SOURCE 


EXPRESSION 
PATTERN 


REFERENCE 


Actin 


Constitutive 


McEIroy et al, Plant Cell, 2: 163-171, 1990 


CAMV 35S 


Constitutive 


Odell etal Nature 313- 810-812 1985 


CaMV 19S 


Constitutive 


Nilsson ef a/., Physiol. Plant 1 00:456-462, 
1997 


GOS2 


Constitutive 


de Pater ef a/, Plant J Nov;2(6):837-44, 
1992 


Ubiquitin 


Constitutive 


Christensen etal, Plant Mol. Biol. 18: 675- 


rice cyclophilin 


constitutive 


Buchholz etal, Plant Mol Biol. 25(5): 837- 
43, 1994 


maize H3 histone 


Constitutive 


Lepetit ef a/, Mol. Gen. Genet. 231:276- 
285, 1992 


actin 2 


Constitutive 


An etal, Plant J. 10(1); 107-121, 1996 



Optionally, one or more terminator sequences may also be used in the construct introduced 
into a plant. The term "terminator* encompasses a control sequence which is a DNA 
5 sequence at the end of a transcriptional unit which signals 3' processing and polyadenylation 
of a primary transcript and termination of transcription. Additional regulatory elements may 
include transcriptional as well as translational enhancers. Those skilled in the art will be aware 
of terminator and enhancer sequences which may be suitable for use in performing the 
invention. Such sequences would be known or may readily be obtained by a person skilled in 
10 the art. 

The genetic constructs of the invention may further include an origin of replication sequence 
which is required for maintenance and/or replication in a specific cell type. One example is 
when a genetic construct is required to be maintained in a bacterial cell as an episomal genetic 
15 element (e.g. plasmid or cosmid molecule). Preferred origins of replication include, but are not 
limited to, the fl-ori and colE1. 

The genetic construct may optionally comprise a selectable marker gene. As used herein, the 
term "selectable marker gene" includes any gene which confers a phenotype on a cell in which 
20 it is expressed to facilitate the identification and/or selection of cells which are transfected or 
transformed with a nucleic acid construct of the invention. Suitable markers may be selected 
from markers that confer antibiotic or herbicide resistance. Cells containing the recombinant 
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DNA will thus be able to survive in the presence of antibiotic or herbicide concentrations that 
kill untransformed cells. Examples of selectable marker genes include the bar gene which 
provides resistance to the herbicide Basta; the npt gene which confers resistance to the 
antibiotic kanamycin; the hpt gene which confers hygromycin resistance. Visual markers, such 
5 as the Green Fluorescent Protein (GFP, Haseloff et al., 1997), p-giucuronidase (GUS) or 
luciferase may also be used as selectable markers. Further examples of suitable selectable 
marker genes include the ampicillin resistance (Ampr), tetracycline resistance gene (Tcr), 
bacterial kanamycin resistance gene (Kanr), phosphinothricin resistance gene, neomycin 
phosphotransferase gene (nptll), hygromycin resistance gene, gene, and the chloramphenicol 
10 acetyltransferase (CAT) gene, amongst others. 

The present invention also encompasses plants obtainable by the methods according to the 
present invention. The present invention therefore provides plants obtainable by the method 
according to the present invention, which plants have increased yield and which plants have 
15 altered CDKD protein activity and/or levels and/or altered expression of a nucleic acid 
encoding a CDKD protein relative to corresponding wild-type plants. 

The invention also provides a method for the production of transgenic plants having increased 
yield, comprising introduction and expression in a plant of a CDKD-encoding nucleic acid or a 
20 functional variant thereof (as defined hereinabove). 

More specifically, the present invention provides a method for the production of transgenic 
plants having increased yield, which method comprises: 

(i) introducing into a plant or plant cell a CDKD-encoding nucleic acid or a functional 
25 variant thereof (as defined hereinabove); 

(ii) cultivating the plant cell under conditions promoting regeneration and mature plant 
growth. 

The nucleic acid may be introduced directly into a plant cell or into the plant itself (including 
30 introduction into a tissue, organ or any other part of a plant). According to a preferred feature 
of the present invention, the nucleic acid is preferably introduced into a plant by transformation. 

The term transformation" as referred to herein encompasses the transfer of an exogenous 
polynucleotide into a host cell, irrespective of the method used for transfer. Plant tissue 
35 capable of subsequent clonal propagation, whether by organogenesis or embryogenesis, may 
be transformed with a genetic construct of the present invention and a whole plant regenerated 
therefrom. The particular tissue chosen will vary depending on the clonal propagation systems 
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available for, and best suited to, the particular species being transformed. Exemplary tissue 
targets include leaf disks, pollen, embryos, cotyledons, hypocotyls, megagametophytes, callus 
tissue, existing meristematic tissue (e.g., apical meristem, axillary buds, and root meristems), 
and induced meristem tissue (e.g., cotyledon meristem and hypocotyl meristem). The 
5 polynucleotide may be transiently or stably introduced into a host cell and may be maintained 
non-integrated, for example, as a plasmid. Alternatively, it may be integrated into the host 
genome. The resulting transformed plant cell may then be used to regenerate a transformed 
plant in a manner known to persons skilled in the art. 

10 Transformation of plant species is now a fairly routine technique. Advantageously, any of 
several transformation methods may be used to introduce the gene of interest into a suitable 
ancestor cell. Transformation methods include the use of liposomes, electroporation, 
chemicals that increase free DNA uptake, injection of the DNA directly into the plant, particle 
gun bombardment transformation using viruses or pollen and microprojection. Methods may 

15 be selected from the calcium/polyethylene glycol method for protoplasts (Krens, F.A. et al., 
1882, Nature 296, 72-74; Negrutiu I. et aL, June 1987, Plant Mol. Biol. 8, 363-373); 
electroporation of protoplasts (Shillito R.D. et al., 1985 Bio/Technol 3, 1099-1102); 
microinjection into plant material (Crossway A. et al., 1986, Mol. Gen Genet 202, 179-185); 
DNA or RNA-coated particle bombardment (Klein T.M. et al., 1987, Nature 327, 70) infection 

20 with (non-integrative) viruses and the like. Transgenic rice plants expressing a CDKD- 
encoding nucleic acid are preferably produced via Agrobacterium-med'\ated transformation 
using any of the well known methods for rice transformation, such as described in any of the 
following: published "European patent application EP 1198985 A1, Aldemita and Hodges 
(Planta, 199, 612-617, 1996); Chan et al. (Plant Mol. Biol. 22 (3) 491-506, 1993), Hiei et al. 

25 (Plant J. 6 (2) 271-282, 1994), which disclosures are incorporated by reference herein as if 
fully set forth. In the case of corn transformation, the preferred method is as described in 
either Ishida et al. (Nat. Biotechnol. 1996 Jun; 14(6): 745-50) or Frame et al. (Plant Physiol. 
2002 May; 129(1): 13-22), which disclosures are incorporated by reference herein as if fully set 
forth. 

30 

Generally after transformation, plant cells or cell groupings are selected for the presence of 
one or more markers which are encoded by plant-expressible genes co-transferred with the 
gene of interest, following which the transformed material is regenerated into a whole plant. 

35 Following DNA transfer and regeneration, putatively transformed plants may be evaluated, for 
instance using Southern analysis, for the presence of the gene of interest, copy number and/or 
genomic organisation. Alternatively or additionally, expression levels of the newly introduced 
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DNA may be monitored using Northern and/or Western analysis, both techniques being well 
known to persons having ordinary skill in the art. 

The generated transformed plants may be propagated by a variety of means, such as by clonal 
5 propagation or classical breeding techniques. For example, a first generation (or T1) 
transformed plant may be selfed to give homozygous second generation (or T2) transformants, 
and the T2 plants further propagated through classical breeding techniques. 

The generated transformed organisms may take a variety of forms. For example, they may be 
10 chimeras of transformed cells and non-transformed cells; clonal transformants (e.g., all cells 
transformed to contain the expression cassette); grafts of transformed and untransformed 
tissues (e.g., in plants, a transformed rootstock grafted to an untransformed scion). 

The present invention clearly extends to any plant cell or plant produced by any of the methods 
15 described herein, and to all plant parts and propagules thereof. The present invention extends 
further to encompass the progeny of a primary transformed or transfected cell, tissue, organ or 
whole plant that has been produced by any of the aforementioned methods, the only 
requirement being that progeny exhibit the same genotypic and/or phenotypic characteristic(s) 
as those produced in the parent by the methods according to the invention. The invention also 
20 includes host cells containing an isolated CDKD-encoding nucleic acid. Preferred host cells 
according to the invention are plant cells. The invention also extends to harvestable parts of a 
plant such as but not limited to seeds, leaves, fruits, flowers, rhizomes, tubers and bulbs. 

The present invention also encompasses the use of nucleic acids encoding CDKDs and the 
25 use of CDKD polypeptides. 

One such use of course relates to the use of a CDKD in increasing plant yield, in particular in 
increasing seed yield. The seed yield may include one or more of the following: increased 
number of filled seeds, increased seed weight, increased harvest index and increased TKW, 
30 among others. The CDKD may be a nucleic acid as represented by SEQ ID NO: 1 or a 
functional variant thereof as hereinbefore defined; or the CDKD may be an amino acid as 
represented by SEQ ID NO: 2 or a functional variant thereof as hereinbefore defined. 

Nucleic acids encoding CDKDs and CDKD polypeptides may also find use in breeding 
35 programmes. The CDKD may be a nucleic acid as represented by SEQ ID NO: 1 or a 
functional variant thereof as hereinbefore defined; or the CDKD may be an amino acid as 
represented by SEQ ID NO: 2 or a functional variant thereof as hereinbefore defined. For 
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example, the CDKD-encoding nucleic acid or a functional variant thereof may be on a 
chromosome (or a part thereof), preferably together with one or more related family members. 
In an example of such a breeding programme, a DNA marker is identified which may be 
genetically linked to a nucleic acid encoding a CDKD protein or a functional variant thereof. 
5 This DNA marker may then used in breeding programs to select plants having increased yield. 

Allelic variants of a CDKD may also be used in conventional breeding programmes, such as in 
marker-assisted breeding. Such breeding programmes sometimes require the introduction of 
allelic variations in the plants by mutagenic treatment of a plant. One suitable mutagenic 

10 method is EMS mutagenesis. Identification of allelic variants then takes place by, for example, 
PCR. This is followed by a selection step for selection of superior allelic variants of the 
sequence in question and which give increased plant yield. Selection is typically carried out by 
monitoring yield of plants containing different allelic variants of the sequence in question, for 
example, different allelic variants of SEQ ID NO: 1. Monitoring yield can be done in a 

15 greenhouse or in the field. Further optional steps include crossing plants, in which the superior 
allelic variant was identified, with another plant This could be used, for example, to make a 
combination of interesting phenotypic features. 

Nucleic acids encoding CDKDs and CDKD polypeptides may also find use as growth 
20 regulators. The CDKD may be a nucleic acid as represented by SEQ ID NO: 1 or a functional 
variant thereof as hereinbefore defined; or the CDKD may be an amino acid as represented by 
SEQ ID NO: 2 or a functional variant thereof as hereinbefore defined. Since these CDKDs are 
useful in increasing yield of plants, the CDKDs would also be useful growth regulators, such as 
herbicides or growth stimulators. The present invention therefore provides a composition 
25 comprising a CDKD, together with a suitable carrier, diluent or excipient, for use as a growth 
regulator. 

The methods according to the invention may also be performed without introducing a nucleic 
acid encoding a CDKD into a plant. This may be achieved by introducing a genetic 
30 modification (preferably in the locus of a CDKD-encoding gene). The locus of a gene as 
defined herein is taken to mean a genomic region, which includes the gene of interest and 
10KB up- or down stream of the coding region. 

The genetic modification may be introduced, for example, by any one (or more) of the following 
35 methods: TDNA activation, tilling, site-directed mutagenesis, homologous recombination or, as 
discussed hereinabove, by introducing and expressing in a plant (cell) a CDKD-encoding 
nucleic acid. 
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T-DNA activation tagging (Hayashi et ai Science (1992) 1350-1353) involves insertion of T- 
DNA usually containing a promoter {may also be a translation enhancer or an intron), in the 
genomic region of the gene of interest or 10KB up- or down stream of the coding region of a 
5 gene in a configuration such that the promoter directs expression of the targeted gene. 
Typically, regulation of expression of the targeted gene by its natural promoter is disrupted and 
the gene falls under the control of the newly introduced promoter. The promoter is typically 
embedded in a T-DNA. This T-DNA is randomly inserted into the plant genome, for example, 
through Agrobacterium infection and leads to overexpression of genes near to the inserted T- 
10 DNA. The resulting transgenic plants show dominant phenotypes due to overexpression of 
genes close to the introduced promoter. The promoter to be introduced may be any promoter 
capable of directing expression of a gene in the desired organism, in this case a plant. For 
example, constitutive, tissue-preferred, cell type-preferred and inducible promoters are all 
suitable for use in T-DNA activation. 

15 

A genetic modification may also be introduced in the locus of a CDKD-encoding nucleic 
acid/gene using the technique of TILLING (Targeted Induced Local Lesions IN Genomes). 
This is a mutagenesis technology useful to generate and/or identify, and to eventually isolate 
mutagenised variants of a CDKD-encoding nucleic acid capable of exhibiting CDKD biological 

20 activity. TILLING also allows selection of plants carrying such mutant variants. These mutant 
variants may even exhibit higher CDKD activity than that exhibited by the gene in its natural 
form. TILLNG combines high-density mutagenesis with high-throughput screening methods. 
The steps typically followed in TILLING are: (a) EMS mutagenesis (Redei and Koncz, 1992; 
Feldmann et a/., 1994; Lightner and Caspar, 1998); (b) DNA preparation and pooling of 

25 individuals; (c) PCR amplification of a region of interest; (d) denaturation and annealing to 
allow formation of heteroduplexes; (e) DHPLC, where the presence of a heteroduplex in a pool 
is detected as an extra peak in the chromatogram; (f) identification of the mutant individual; 
and (g) sequencing of the mutant PCR product. Methods for TILLING are well known in the art 
(McCallum Nat Biotechnol. 2000 Apr; 18(4):455-7, reviewed by Stemple 2004 (TILLING-a 

30 high-throughput harvest for functional genomics. Nat Rev Genet 2004 Feb;5(2): 145-50.)). 

Site directed mutagenesis may be used to generate variants of CDKD-encoding nucleic acids. 
Several methods are available to achieve site directed mutagenesis, the most common being 
PCR based methods (current protocols in molecular biology. Wiley Eds. 
35 http://www.4ulr.com/products/currentprotocols/index.html). 
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TDNA activation, TILLING and site-directed mutagenesis are examples of technologies that 
enable the generation of novel alleles and CDKD variants. 

Homologous recombination allows introduction in a genome of a selected nucleic acid at a 
5 defined selected position. Homologous recombination is a standard technology used routinely 
in biological sciences for lower organisms such as yeast or the moss physcomitrella. Methods 
for performing homologous recombination in plants have been described not only for model 
plants (Offringa et a/. Extrachromosomal homologous recombination and gene targeting in 
plant cells after Agmbacterium-med\ated transformation. 1990 EM BO J. 1990 Oct; 9(10):3077- 

10 84) but also for crop plants, for example rice (Terada R, Urawa H, Inagaki Y, Tsugane K, lida 
S. Efficient gene targeting by homologous recombination in rice. Nat Biotechnol. 2002. lida and 
Terada: A tale of two integrations, transgene and T-DNA: gene targeting by homologous 
recombination in rice. Curr Opin Biotechnol. 2004 Apr; 15(2): 132-8). The nucleic acid to be 
targeted (which may be a CDKD-encoding nucleic acid or variant thereof as hereinbefore 

15 defined) need not be targeted to the locus of a CDKD-encoding gene, but may be introduced 
in, for example, regions of high expression. The nucleic acid to be targeted may be an 
improved allele used to replace the endogenous gene or may be introduced in addition to the 
endogenous gene. 

20 The methods according to the present invention result in plants having increased yield, as 
described hereinbefore. These advantageous yield effects may also be combined with other 
economically advantageous traits, such as further yield-enhancing traits, tolerance to various 
stresses, traits modifying various architectural features and/or biochemical and/or physiological 
features. 

25 

Description of figures 

The present invention will now be described with reference to the following figures in which: 

Fig. 1 is a tree showing various plant CDKs. Full-length CDK protein sequences were aligned 
30 using the "ClustalX1.81" software with its default parameters. A neighbour-joining tree was 
calculated from this alignment using "ClustalX1.81" with its default parameters. The tree was 
drawn using the "drawgram" program of the "PhylipS.S" package with its default parameters. 

Fig- 2 shows a binary vector for expression in Oryza sativa of the Arahidopsis thaliana 
35 CDKD;1 gene under the control of a GOS2 promoter. 
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Fig. 3 details examples of sequences useful in performing the methods according to the 
present invention. 

Examples 

5 The present invention will now be described with reference to the following examples, which 
are by way of illustration alone. 

DNA manipulation: unless otherwise stated, recombinant DNA techniques are performed 
according to standard protocols described in (Sambrook (2001) Molecular Cloning: a 
10 laboratory manual, 3rd Edition Cold Spring Harbor Laboratory Press, CSH, New York) or in 
Volumes 1 and 2 of Ausubel et aL (1994), Current Protocols in Molecular Biology, Current 
Protocols. Standard materials and methods for plant molecular work are described in Plant 
Molecular Biology Labfase (1993) by R.D.D. Cray, published by BIOS Scientific Publications 
Ltd (UK) and Blackwell Scientific Publications (UK). 

15 

Example 1: Gene Cloning 

The Arabidopsis CDKD1;1 was amplified by PCR using as template an Arabidopsis thaliana 
seedling cDNA library (Invitrogen, Paisley, UK). After reverse transcription of RNA extracted 
from seedlings, the cDNAs were cloned into pCMV Sport 6.0. Average insert size of the bank 

20 was 1 .5 kb, and original number of clones was of 1.59x10 7 cfu. Original titer was determined to 
be 9.6x10 5 cfu/ml, after first amplification of 6x1 0 11 cfu/ml. After plasmid extraction, 200 ng of 
template was used in a 50 pi PCR mix. Primers prm2676 (sense, start codon in bold, AttB1 
site in italic: 5' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCACA 
ATGGAACAG CCG AAG AAAG 3') and prm3677 (reverse, complementary, stop codon in bold, 

25 AttB2 site in italic: 5' GGGGACCACTTTGTACAAGAAAGCTGGGT 

CCTATAGGAACTCGAGATCAAGTT 3'), which include the AttB sites for Gateway 
recombination, were used for PCR amplification. PCR was performed using Hifi Taq DNA 
polymerase in standard conditions. A PCR fragment of 1256 bp was amplified and purified 
also using standard methods. The first step of the Gateway procedure, the BP reaction, was 

30 then performed, during which the PCR fragment recombines in vivo with the pDONR201 
plasmid to produce, according to the Gateway terminology, an "entry clone", p2777. Plasmid 
pDONR201 was purchased from Invitrogen, as part of the Gateway® technology. 

Example 2: Vector Construction 

35 The entry clone p2777 was subsequently used in an LR reaction with p0640, a destination 
vector used for Oryza sativa transformation. This vector contained within the T-DNA borders: 
a plant selectable marker; a screenable marker; and a Gateway cassette intended for LR in 
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vivo recombination with the sequence of interest already cloned in the entry clone. A rice 
GOS2 promoter for constitutive expression was located upstream of the Gateway cassette. 

After the LR recombination step, the resulting expression vector as shown in Figure 2 (CDK 
5 D1;l::GOS2 - upregulation) was transformed into Agrobacterium and subsequently into Oryza 
sativa plants. Transformed rice plants were allowed to grow and were then examined for the 
parameters described in Example 3. 

Example 3: Evaluation and Results 

10 Approximately 15 to 20 independent TO rice transformants were generated. The primary 
transformants were transferred from tissue culture chambers to a greenhouse for growing and 
harvest of T1 seed. 6 events, of which the T1 progeny segregated 3:1 for presence/absence 
of the transgene, were retained- For each of these events, approximately 10 T1 seedlings 
containing the transgene (hetero- and homo-zygotes), and approximately 10 T1 seedlings 

15 lacking the transgene (nullizygotes), were selected by monitoring visual marker expression. 

Statistical analysis: t-test and F-test 

A two factor ANOVA (analysis of variants) was used as a statistical model for the overall 
evaluation of plant phenotypic characteristics. An F-test was carried out on all the parameters 

20 measured of all the plants of all the events transformed with the gene of the present invention. 
The F-test was carried out to check for an effect of the gene over all the transformation events 
and tCL verify for an overall effect of the gene, also known as a globgl gene effect. The 
threshold for significance for a true global gene effect was set at a 5% probability level for the 
F-test. A significant F-test value points to a gene effect, meaning that it is not only the 

25 presence or position of the gene that is causing the differences in phenolype. 

To check for an effect of the genes within an event, i.e., for a line-specific effect, a t-test was 
performed within each event using data sets from the transgenic plants and the corresponding 
null plants. "Null plants" or "Null segregants" are the plants treated in the same way as 

30 transgenic plants, but from which the transgene has segregated. Null plants may also be 
described as homozygous negative transformants. The threshold for significance for the t-test 
was set at a 10% probability level. Within one population of 5 transformation events, some 
events can be under or above this t-test threshold. This is based on the hypothesis that a 
gene might only have an effect in certain positions in the genome, and that the occurrence of 

35 this position-dependent effect is not uncommon. This kind of gene effect is also known as a 
line effect of the gene. The p-value was obtained by comparing the t-value to the t-distribution 
or alternatively, by comparing the F-value to the F-distribution. The p-value then stands for the 
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probability that the null hypothesis (null hypothesis being "there is no effect of the transgene") 
is correct. 

4 A Vegetative growth measurements: 

5 The selected T1 plants (approximately 10 with the transgene and approximately 10 without the 
transgene) were transferred to a greenhouse. Each plant received a unique barcode label to 
link unambiguously the phenotyping data to the corresponding plant. The selected T1 plants 
were grown on soil in 10 cm diameter pots under the following environmental settings: 
photoperiod= 11.5 h, daylight intensity= 30,000 lux or more, daytime temperature= 28°C or 

10 higher, night time temperature= 22°C, relative humidity= 60-70%. Transgenic plants and the 
corresponding nullizygotes were grown side-by-side at random positions. From the stage of 
sowing until the stage of maturity each plant was passed several times through a digital 
imaging cabinet and imaged. At each time point digital images (2048x1536 pixels, 16 million 
colours) were taken of each plant from at least 6 different angles. The parameters described 

15 below were derived in an automated way from all the digital images of all the plants, using 
image analysis software. 

4.1.1 Aboveground plant area 

Plant aboveground area was determined by counting the total number of pixels from 
20 aboveground plant parts discriminated from the background. This value was averaged for the 
pictures taken on the same time point from the different angles and was converted to a 
physical surface value expressed in square mm by calibration. Experiments show that the 
aboveground plant area measured this way correlates with the biomass of plant parts above 
ground. The best 3 lines of the T1 evaluation were then evaluated in the T2 round. The 
25 results of the T2 evaluation are shown in Table 1 below. As shown, one of the lines shows a 
statistically significant increase in aboveground area (with p-vaiue from the t-test of 0.0107) 
compared to corresponding nullizygotes. 



Table 1 




30 



Each row corresponds to one event, for which the aboveground area was determined for the 
transgenics (TR) and the null lines (null), expressed in units. The numeric difference between 
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the positive plants and the negative plants is given (dif) as well as the percentage of difference 
between these plants (% dif). P-value stands for the probability produced by the t-test for each 
plant line. The last row presents the average numbers for ail events. There, the p-value 
stands for the p-value derived from the F-test 

5 4.2 Seed-related parameter measurements 

The mature primary panicles were harvested, bagged, barcode-labelled and then dried for 
three days in the oven at 37°C. The panicles were then threshed and all the seeds were 
collected and counted. The filled husks were separated from the empty ones using an air- 
blowing device. The empty husks were discarded and the remaining fraction was counted 
10 again. The filled husks were weighed on an analytical balance. This procedure resulted in the 
set of seed-related parameters described below. 

4.2.0 Number of filled seeds 

The number of filled seeds was determined by counting the number of filled husks that 
15 remained after the separation step. Again, 3 of the best plants from the T1 evaluation were 
taken to the T2 round. The results of the T2 evaluation are shown in Table 2 below. As 
shown, 2 of the lines showed a significant increase in the number of filled seeds of transgenic 
plants relative to the number of filled seeds of corresponding non-transgenic plants. There 
was also an overall gene effect as concluded by the significant p value from the F-test of 0. 

20 

Table 2 



r Number of Filled Seeds 


{Line ||TR null jjdif 


!!% dif 


jp-vaiue j 


*0010A 1119 |82.3 J36.65 


J|45 


]j0.0588 1 


0009A * 146.6 ; 


18.7 1127.85 


:j684 


WZZ.J 


jOOOTA (1198.6 j 


207.7 j;-9.15" 


jj-4 


fo.7586 1 


'Overall |R 54.7 J 


102.9 |51.78 


j|50 





Each row corresponds to one event, for which the number of filled seeds was determined for 
the transgenics (TR) and the null lines (null), expressed in units. The numeric difference 
25 between the positive plants and the negative plants is given (dif) as well as the percentage of 
difference between these plants (% dif). R-value stands for the probability produced by the t- 
test for each plant line. The last row presents the average numbers for all events. There, the 
p-value stands for the p-value derived from the F-test 
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4.2.1 Total seed yield per plant 

The total seed yield was measured by weighing all filled husks harvested from a plant. Again, 
3 of the best plants from the T1 evaluation were taken to the T2 round. The results of the T2 
evaluation are shown in Table 3 below. As shown, 2 of the lines showed a significant increase 
5 in seed weight for transgenic plants relative to the seed weight of corresponding non- 
transgenic plants. There was also an overall gene effect as concluded by the significant p 
value from the F-test of 0. 



Table 3 



10 



15 




Each row corresponds to one event, for which the total seed weight was determined for the 
transgenics (TR) and the nuli lines (null), expressed in units. The numeric difference between 
the positive plants and the negative plants is given (dH) as well as the percentage of difference 
between these plants (% dif). P-value stands for the probability produced by the t-test for each 
plant line. The last row presents the average numbers for all events. There, the p-value 
stands for the p-value derived from the F-test. 



4.2.3 Harvest index of plants 

The harvest index in the present invention is defined as the ratio between the total seed yield 
20 and the above ground area (mm 2 ), multiplied by a factor 106. Three of the best plants from 
the T1 evaluation were taken to the T2 round and the results of the T2 evaluation are 
shown in Table 4 below. As shown, 1 line showed an increased harvest index for transgenic 
plants relative to the harvest index of corresponding non-transgenic plants, with a p value from 
the t-test of 0. An overall gene effect was also evident with a p-value from the F-test of 0. 

25 

Table 4 



jHarvest Index 










JGne~~ 


™|TR~" 


|null 




j[%dif" 


p-value 


IpOlOA 
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[86.3 


;j9.47 




;0.3758 
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|j85.93_ 
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[194.6 

.. ..... ........ _„ L ^„ 
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ljO.9885 
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M .J 



26 



WO 2005/083094 



PCT/EP2005/050874 



Each row corresponds to one event for which the harvest index was determined for the 
transgenics (TR) and the null lines (null), expressed in units. The numeric difference between 
the positive plants and the negative plants is g/Ven (dif) as well as the percentage of difference 
5 between these plants (% dif). P-value stands for the probability produced by the t-test for each 
plant fine. The last row presents the average numbers for all events. There, the p-value 
stands for the p-value derived from the F-test 

4.2.4 Thousand Kernel Weight (TKW) 

10 This parameter is extrapolated from the number of filled seeds counted, and their total weight. 
Three of the best plants from the T1 evaluation were taken to the T2 round and the 
results of the T2 evaluation are shown in Table 5 below. As shown, one of the lines showed 
an increase in the TKW for transgenic plants relative to corresponding non-transgenic plants, 
with a p value from the t-test of 0.0455. 

15 

Table 5 



i TKW ....... .. 1 


IpLine 


TR [null 


fjdif |% d 


if j p-value 1 




j|25jj|24.6 


Jq.52l2~'~' 




!OS0934-0009A 


j 26.1 J24.8 




0.0455 | 


|OS0934-0007A 


j|26.5;j26.8 


if6.3 f-1 


J0J5609 | 


{Overall 


:|25T9j25.5 


;jo.39 j|2_ 


}0.2473 jj 



Each row corresponds to one event, for which the TKW was determined for the transgenics 
(TR) and the null lines (null), expressed in units. The numeric difference between the positive 
20 plants and the negative plants is given (dif) as well as the percentage of difference between 
these plants (% dif). P-value stands for the probability produced by the t-test for each plant 
line. The last row presents the average numbers for all events. There, the p-value stands for 
the p-value derived from the F-test 
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